Discovering Relations among Named Entities from Large Corpora
نویسندگان
چکیده
Discovering the significant relations embedded in documents would be very useful not only for information retrieval but also for question answering and summarization. Prior methods for relation discovery, however, needed large annotated corpora which cost a great deal of time and effort. We propose an unsupervised method for relation discovery from large corpora. The key idea is clustering pairs of named entities according to the similarity of context words intervening between the named entities. Our experiments using one year of newspapers reveals not only that the relations among named entities could be detected with high recall and precision, but also that appropriate labels could be automatically provided for the relations.
منابع مشابه
Named Entity Relation Mining using Wikipedia
Discovering relations among Named Entities (NEs) from large corpora is both a challenging, as well as useful task in the domain of Natural Language Processing, with applications in Information Retrieval (IR), Summarization (SUM), Question Answering (QA) and Textual Entailment (TE). The work we present resulted from the attempt to solve practical issues we were confronted with while building sys...
متن کاملDiscovering Relations among Named Entities by Detecting Community Structure
This paper proposes a networked data mining method for relations discovery from large corpus. The key idea is representing the named entities pairs and their contexts as the network structure and detecting the communities from the network. Then each community relates to a relation the named entities pairs in the same community have the same relation. Finally, we labeled the relations. Our exper...
متن کاملSemantically-Driven Extraction of Relations between Named Entities
In this paper, we describe a method that automatically generates lexico-syntactic patterns which are then used to extract semantic relations between named entities. The method uses a small set of seeds, i.e. named entities that are a priori known to be in relation. This information can easily be extracted from encyclopedias or existing databases. From very large corpora we extract sentences tha...
متن کاملRelANE: Discovering Relations between Arabic Named Entities
In this paper, we describe the first tool that detects the semantic relation between Arabic named entities, henceforth RelANE. We use various supervised learning techniques to predict the word or the sequence of terms that can highlight one or more semantic relationship between two Arabic named entities. For each word in the sentence, we use its morphological, contextual and semantic features o...
متن کاملRelation Extraction with Massive Seed and Large Corpora
The research area of information extraction (IE) aims to extract relevant structured information from natural language texts. In addition to the named-entity recognition (NER) task, the identification and classification of relations among entities, namely, the so-called relation extraction (RE) task, is particularly important for many real-world applications. Given the sentence in Figure 1, a R...
متن کامل